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Q . ABSTRACT 

c3 ■ 

• ' The statistics of sea state duration (persistence) have been found to be de- 

Yi I pendent upon the recording interval At. Such behavior can be explained 

• ^H . as a consequence of the fact that the graph of a time series of an environ- 

s' ' mental parameter such as the significant wave height has an irregular, "frac- 

^H i tal" geometry. The mean duration, t can have a power-law dependence on 

f^ . At as At — >■ 0, with an exponent equal to the fractal dimension of the level 

sets of the time series graph. This recording interval dependence means that 
the mean duration is not a well defined quantity to use for marine opera- 
Ol . tional purposes. A more practical quantity may be the "useful mean dura- 

]^ ' tion",T^, estimated from the formula {J2 '^i)/i'l2 Ti), where each interval 

^1 , [ti, ti + Ti] satisfying the appropriate criterion is weighted by its duration. 

These results are illustrated using wave data from the Frigg gas field in the 
North Sea. 
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INTRODUCTION 



The duration or persistence statistics of sea state and other environmental parameters are impor- 
tant for purposes such as marine engineering operations, in which, for example, useful work can 
1 J - only be performed if the significant wave height h is less than a particular value ho. Over 

r> ' recent decades a number of observational and theoretical studies have been made, in order 

>3 . to relate the statistical behavior of the duration of various sea state criteria to, for example, 

the probability distribution of the wave height, its seasonal variation, and other parameters 
(e.g. HoumbandVik, 1975; Graham, 1982; Mathiesen , 1994; Tsekos and Anastasiou , 1996; 
[Soukissian and Theochari, 2001). In this paper we shall consider the collection of time inter- 
vals, in which h{t) < ho: the intervals having durations n, where i = 1, 2, . . . , A'^. The mean 
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duration, r, is calculated using the formula 

r = {l/N)Y,r^. (1) 

i 

We can, of course, choose other criteria, for example h > hi, ho < h < hi, etc. 

If h{t) is differentiable with respect to t, the mean d uration is related to the probability 
distrib ution of h and dh/dt by the Rice-Kac formula fe . g . iRicei 1 1 944 iKaJ 1 1 943L iMathiesenL 
11994: 



-,^ . ^ 2Fhiho) 

fhiho) E [\dh/dt\ \h = ho]' 

where Fh is the cumulative distribution function of h, i.e., Fh{ho) is the probability that h < ho, 
fh{ho) = dFh{ho)/dho is the probability density function of h, and E [\dh/dt\ | h = ho] is 
the expectation (mean value) of the absolute value of dh/dt, given that h = ho. 

However, time series of observed environmental parameters often have a noisy, irregular 
appearance, so that measurements recorded at frequent intervals show considerable structure 
which does not appear if the measurements are recorded less frequently. In such a case, the time 
derivative is not well defined, so the Rice~Kac formula of Eq.|5| cannot be applied. Although 
this may partly be due to the effect of errors in the measurement or of sampling variability, 
the phenomenon has been recognized as a manifestation of fractal behavior, shared with such 

iihenomena as the irregularity of coastlines, the surfaces of snowflakes, and Brownian motion 
Mandelbrotl ll983h . It is a general property of such fractal objects or curves that their irregular 
shape remains even when you examine them at finer and finer scales. A fractal curve, such as a 
coastline, does not have a well defined length, and if you measure it with "rulers" of finer and 
finer length, the total length will increase without bound. In order to measure such irregular 
objects, it is helpful to generalize the concept of "dimension" to non-integer values, and this was 
done by Hausdorff and Besicovitch in the early part of the twentieth century (e.g. Hausdorfl, 
[l919J). A fractal curve, since it has an unbounded length, but does not fill a plane, has a fractal 
dimension (Hausdorff-Besicovitch dimension, or just Hausdorff dimension) greater than 1 but 
less than 2; a fr actal surface (e.g. a fracture surface) will have a fractal dimension between 2 
and 3, and so on. lFedeiHl98 B) showed, by analyzing wave data from the Norwegian continental 
shelf, that a time series of significant wave height can have a fractal behavior. More details on 
the definition and calculation of fractal dimension are given in the next section. 

If the graph of an environmental parameter h{t) has fractal behavior, it will have a fractal 
dimension between 1 and 2. If this graph crosses a horizontal straight line, it will intersect the 
line infinitely many times. If we consider a small interval around one of the intersection points, 
there will always be more intersection points within the interval, no matter how small the interval 
is. The collection of points where h{t) intersects the line h = hois called the level set of h{t) at 
ho, and has a fractal dimension between and 1. A point has dimension zero, as has a collection 
of points which are all more than a certain distance apart, but this particular collection of points 
will be so numerous and irregularly distributed that it will have a fractional dimension strictly 
greater than zero. 

Since the number of intervals satisfying h < ho is infinite, any estimate of r using values of 
h{t) sampled at successive recording intervals At will become smaller and smaller as At — > 0. 
This type of behavior was found by Soukissian and Theochari ( 2001 ) for measured time series of 
significant wave height and wave period. They derived formulas for mean duration of the form 
T — a + b\og{At/So), where a, b, and So are constants. However, this logarithmic formula is 
not valid in the limit At —> 0, since r cannot be negative. 



DURATION DISTRIBUTION 

The only possible limits of r as At — » are a finite positive value or zero (we assume here that 
the measurements take place over a finite period so that the maximum value of Ti is finite). To 
make more specific predictions of the behavior of the duration s tatistics, we shall now determine 
how they are related to the Hausdorff-Besicovitch dimension JHausdorfn. Il919ti of the above- 
mentioned level sets of fe(i). 

Firstly, we define the d- dimensional Hausdorff measure A'*(5) of a set <S (e.g. iMandelbrotl 
ll983l:lRevuz and YoAfloQll) : 

A'iS) = lim inf (^J2 ^^^) > P- < P. (3) 

where the infimum (greatest lower bound) is over all coverings of 5 by balls (solid spheres) of 

d/2 d 

radius less than or equal to p. (The quantity '^j is the d-dimensional generalization of the 

formula for the volume of a ball of radius pi.) In other words, you cover the set with as few 
balls as possible no greater than a certain size, add up their "d-dimensional" volumes, and repeat 
the process with smaller and smaller balls. If d is an integer, and 5 is a subset of d-dimensional 
Euclidean space, then the Hausdorff measure is the d-dimensional Lebesgue measure, equivalent 
to what we usually mean by length, area, volume, etc. 

If A''{S) = for d > D and A'*(5) = oo for d < D, then D is said to be the Hausdorff 
dimension of S. In this case, A^(iS) may be zero, finite, or infinite. The level sets of h{t) are 
sets of points embedded in a one-dimensional space, so the "balls" in the definition of Hausdorff 
measure may be replaced by intervals of length Si — 2pi. 

In the case of Brownian motion, w here h ( t) can be thought of as a one-dimensional random 
walk with infinitesimally small steps JLevvlll965r). the level sets hav e D = ■^. Fractional 
Brownian motion Bnit), described bv lMandelbrot and Van Nes s^ ( 1968), where H is the Hurst 
exponent, < H < 1, and with H — ^ corresponding to ordinary Brownian motion, has level 
sets with D = 1 — H. If h{t) is a random function whose values at different values of t are 
completely uncorrelated, it will have level sets with D — 1. 

The simplest method of computing the Hausdorff dimension is by the box-counting method, 
where we just divide the time domain into fixed intervals of length s. If A'^ such fixed intervals 
are required to cover the level set, then the box-counting dimension can be defined as minus the 
slope of the graph of log A^ v. log s, were we can, for example, fit a straight line to the graph of 
the data points. (Strictly speaking, for the Hausdorff and box-counting dimensions to be equal, 
we should compute the asymptotic slope for small box sizes.) We say that 

N{8) = O(0(s) ■ s-°), (4) 

where (/!>(s) is a function which varies more slowly than any power of s. For simplicity we will 
henceforth neglect the presence of (^(s), and use the notation A ~ B to mean A — 0{B). 

The distribution of the lengths Ti of intervals where the relation h < ho is satisfied depends 
on the distribution of points of the corresponding level set £, since the end points of each of the 
intervals are also points in C. If we cover £ with fixed intervals of length s, the total number 
of fixed intervals required will also be of the order of N{Ti > s), the number of intervals with 
duration greater than s. But by the definition given above of fractal dimension, the number of 
fixed intervals required is of order s^^. 

Thus we have 

N{n > s) ~ s"-°. (5) 



For < D < 1 the mean duration is then given by 

"^ s-° ds) I ( r^^^ s-^-i ds\ . (6) 



The upper integration hmit Tmax can be taken to be the total observation time T, and the lower 
limit can be taken to be the recording interval Af . We then have, for < D < 1, 

T^-°-M'-° D 



At-°-T-o 1-D' (7) 

~ T^-°At° ■ D/(l - D) since T > At. 

For D = Q, the level set C will in general have a finite number of points, so r should become 
constant for sufficiently small At. 

For D > 0, the mean duration tends to zero as At ^ 0. For marine and offshore engineering 
purposes this means that it is a parameter which is not particularly informative if we record 
observations at frequent intervals, in spite of the fact that we would expect the environmental 
data to be more useful if sampled more often. The reason for the poor behavior of r for small 
values of At is that it over-represents very short intervals which contribute very little to the 
engineering work which could, for example, be performed. If we weight each interval by its 
length Ti, we obtain what we can call the useful mean duration: 

Using Eq.|5| we obtain r" ~ (const.) • T for < D < 1, independent of At. The most useful 
way of presenting the distribution of duration of intervals of length Ti is to determine th e fraction 
of the time Fiji > s) occupied by the intervals of more than a specified length fe.e. lGrahaml 
rTgSZ) . and this also requires weighting them according to their length: 



F{Ti > r) = (l/T) E 



E- 



(l/T) / s dNiji > s). 



(9) 



(Note that the integration limits are reversed, because N(Ti > s) is a decreasing function of s. 
The quantity N(Ti < s) may be infinite.) 

It should be noted that if dh/dt in the Rice-Kac formula (Eq.|2j is replaced by its discrete 
version Ah/ At = (/i(f + At) - h{t))/At, we obtain the relation 



E 



Ah 



-1 

(10) 



At 
(e.g. |jenkinsll2002h . 

ANALYSIS OF FIELD MEASUREMENTS 

Feder's fractal wave analysis 

lFedeii h98ff) made a fractal analysis of 3 years of wave data from Troms0flaket on the Norwegian 
continental shelf. After performing a seasonal adjustment on the time series of the significant 
wave height, he calculated a cumulative sum, effectively integrating the results with respect to 
time, to obtain a record which he found to have fractal behavior with a Hurst exponent H = 



0.92, corresponding to a 'level set' fractal dimension of 0.08. The asymptotic fractal behavior 
was found for time scales between 3 hours and 10 days. However, the wave record itself is 
more irregular, in fact very discontinuous: if the cumulative sum of the wave height record has 
D — 0.08, the wave height record itself, obtained by taking differences, will have level sets 
which have a fractal dimension of rmn(l + 0.08, 1) = 1. 

Hence for any limiting wave height ho there may be periods of finite duration where the 
wave height is crossing over and under ho at virtually every recording interval. The asymptotic 
behavior of r will thus be r ~ At as At -^ 0. 

Analysis of North Sea data 

Feder's results are, however, based on measurements with the rather coarse time interval of 
3 hours, and his method of analysis, based on summing the measurements, does not lead itself 
easily to the direct determination of either the fractal dimension of the level sets, or of the calcu- 
lation of the duration statistics. In the present study we analyze in a direct fashion a wave height 
time series which has a finer, 20-minute time resolution. 

The top frame of Fig. Q shows a time series of significant wave height from the Frigg gas 
field in the North Sea, for the year of 1984, based on an analysis of 20-minute records. The time 
series was re-sampled at a range of different recording intervals, from the initial 20 minutes up 
to 14 days (336 hours). The mean duration and useful mean duration as a function of recording 
interval are plotted in the center frame. 

The dependence of the mean duration on the recording interval is fairly well approximated 
by a power-law relation, r ~ Ai"''^^. From Eq. Qwe would therefore expect the Hausdorff 
or the box-counting dimension of the level set of /i = /io = 5m to be 0.676. However, the 
straight-line fit by least-squares regression in the lower frame of the figure shows, according to 
Eq.|4] that the box-counting dimension is D = 0.474. Similar values of the exponent of the 
power-law dependence of mean duration on the recording interval, and box-counting dimension 
of the corresponding level set, are obtained for other values of ho- The alternative formula of 
Eg . 11 01 based on the Rice-Kac relation, relating the mean duration to \Ah/At\, was also applied 
to the same data set bv Jenkins. 12002) . and an intermediate exponent of 0.57 was obtained. 

The useful mean duration t" also varies quite considerably with the recording interval. 
There are a number of large jumps in the graph, which indicates that what is happening is that 
successive periods of h < ho are being amalgamated. Nevertheless, the variation of r" with 
At is less rapid than that of the mean duration, and its value for small values of At is about 
150 hours (6 days), which may be a more reasonable estimate of a typical "h < 5 m" period than 
the corresponding mean duration, which is approximately 25 hours. 

Figure |2| shows the seasonal variation of the corresponding statistics for the ho — 2.5 m 
level, using four consecutive three-month subsets of the data. Both the mean duration and the 
useful mean duration for /i < 2.5 m are, as would be expected, greater in the summer than in 
the winter months. The discrepancy between the calculated box-counting dimension and the 
exponent in the mean duration's power-law dependence are also present. The graph of useful 
mean duration is noticeably flatter in the winter months than in the summer months. 

This discrepancy between the calculated box-counting dimension and the exponent in the 
mean duration's power-law dependence also occurs for simulated data, as shown in Fig.|3| The 
simulated data are generated by firstly constructing an approximation to Brownian motion B{t) 
by generating a Gaussian random walk with time steps of 20 minutes, and then converting the 
results to strictly positive values by applying an exponential function, so we obtain a sample 
curve of exp(_B(t)). The Hausdorff dimension of the level sets of this graph should be D = 0.5, 
close to the calculated box-counting dimension of the Frigg level set, but the computed straight- 
line regression value (for h = 2.0) is here 0.379, indicating that further investigation is necessary 
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Fig. 1 Top: Time series of significant wave height (h — /f 1/3) from the Frigg gas field during 
1984. Center: Mean duration and useful mean duration for intervals in the dataset with h < 5m. 
Bottom: Number of fixed intervals necessary to cover the level set of /i = 5 m, as a function of 
the interval length. 



into what algorithm is suitable for calculating Hausdorff dimension. The exponent of the power- 
law dependence of the mean duration is 0.459, rather greater than the calculated box-counting 
dimension. 

lOorski ( 2001 ) has a useful discussion of the limitations of the box-counting technique, and 
some suggestions for improvement, for example by only counting points separated by more than 
the box (fixed interval) size. Strictly speaking, we should estimate the asymptotic slopes of the 
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Fig. 2 Upper graphs: Mean duration and useful mean duration for intervals in the Frigg dataset 
with h < 2.5 m, for successive three-month periods. Lower graphs: Number of fixed intervals 
necessary to cover the corresponding h — 2.5 m level sets. 



curves in the log-log plots, by only considering recording intervals below a given cut-off value. 
However, given the limited amount of data available, it is difficult to determine what a suitable 
cut-off value should be. 
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Fig. 3 Top: Simulated time series (exponentiated Gaussian random walk). Center: Mean 
duration and useful mean duration for intervals in the simulated dataset with h < 2.0. Bottom: 
Number of fixed intervals necessary to cover the level set of h — 2.0, as a function of the interval 
length. 



The graph of the useful mean duration for the simulated data is much flatter than for the field 
data, so that for small values of At the ratio of r" to r is much larger This may be due to the 
fact that the graph of the simulated data has by definition zero sampling variability, and so no 
"random excursions" will occur which may tend to split up otherwise continuous periods with 
h < ho- 



CONCLUSION 

The theoretical analysis presented here shows that duration statistics of ocean waves and other 
environmental parameters whose time dependence has an irregular, noisy behavior, can be related 
to the fractal characteristics of the relevant time series. In particular, the asymptotic behavior of 
the so-called "mean duration" of, for example, intervals where the wave height is below a certain 
level, is in general strongly dependent on the recording interval of the observations, and may 
have a power-law dependence with an exponent equal to the Hausdorff-Besicovitch dimension 
or the box-counting dimension of the associated level set. 

Testing this hypothesis using a one-year data set of significant wave height data from the 
Frigg gas field, sampled at 20-minute intervals, and then re-sampled at different recording inter- 
vals up to a maximum of 14 days, indicates that there does appear to be power-law dependence 
of the mean duration on the recording interval, but that the exponent may not be the same as 
the box-counting dimension of the corresponding level set. A similar discrepancy is found if 
simulated data are used (an exponential function applied to a random walk), and more detailed 
studies are required to uncover the reason for this. 

Instead of using the mean duration to characterize the duration statistics, an alternative useful 
mean duration is proposed, weighting each interval with its actual duration. This is more mean- 
ingful in terms of, for example, the amount of work which can be performed during maritime 
engineering operations. Estimates of this parameter are found to be somewhat less sensitive to 
changes in the recording interval than the mean duration, and have an appropriate seasonal vari- 
ation. However, the useful mean duration still does vary quite considerably with recording inter- 
val, and this is probably due to both the inevitable effect of changes in the detectability of wave 
height variations, and the effect of sampling variability. The fact that the useful mean duration 
varies much more slowly when simulated data are used does indicate that sampling variability 
is a contributory factor. However, further studies using other types of simulated time-series are 
necessary before any definite conclusion can be drawn. 

The theory of excursions of continuous-in-time stochastic processes, and the intensity (prob- 
ability per unit time) of transition from e.g. the state h < ho to the state h > ho shoul d also give 
useful resu lts for engineer ing purposes. Suitable references to the theory include .Revuz and Yon 
il99ll) and lMichnal il999f) . The relationship between excursion theory and the results described 
in this paper is a topic for further investigation. 
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